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The conceptual setting of quantum mechanics is subject to an ongoing debate from its beginnings 
until now. The consequences of the apparent differences between quantum statistics and classical 
statistics range from the philosophical interpretations to practical issues as quantum computing. 
In this note we demonstrate how quantum mechanics can emerge from classical statistical systems. 
We discuss conditions and circumstances for this to happen. Quantum systems describe isolated 
subsystems of classical statistical systems with infinitely many states. While infinitely many classical 
observables "measure" properties of the subsystem and its environment, the state of the subsystem 
can be characterized by the expectation values of only a few probabilistic observables. They define 
a density matrix, and all the usual laws of quantum mechanics follow. No concepts beyond classical 
statistics are needed for quantum physics - the difi'erences are only apparent and result from the 
particularities of those classical statistical systems which admit a quantum mechanical description. 
In particular, we show how the non-commuting properties of quantum operators are associated to 
the use of conditional probabilities within the classical system, and how a unitary time evolution 
reflects the isolation of the subsystem. 



A realization of quantum mechanics as a classical statis- 
tical system may shed new light on the conceptual inter- 
pretation of experiments based on entanglement, as tele- 
portation or quantum cryptography One may even 
speculate that steps in quantum computing could be 
realized by exploiting classical statistics. Recently, classi- 
cal statistical ensembles that are equivalent to four-state 
and two-state quantum mechanics have been constructed 
explicitely Q • This constitutes a proof of equivalence 
of few-state quantum statistics and classical statistics for 
infinitely many degrees of freedom. In view of the particu- 
lar manifolds of classical states for these examples one may 
wonder, however, if quantum statistical systems are very 
special cases of classical statistics, or if they arise rather 
genuinely under certain conditions. In this note we argue 
that quantum statistics can indeed emerge rather generally 
if one describes small "isolated" subsystems of classical en- 
sembles with an infinity of states. 

An atom in quantum mechanics is an isolated system 
with a few degrees of freedom. This contrasts with quan- 
tum field theory, where an atom is described as a particular 
excitation of the vacuum. The vacuum in quantum field 
theory is a complicated object, involving infinitely many 
degrees of freedom. In a fundamental theory of particle 
physics, which underlies the description of atoms, collec- 
tive effects as spontaneous symmetry breaking are crucial 
for its understanding. Our treatment of atoms in the con- 
text of classical statistics is similar to the conceptual set- 
ting of quantum field theory. A classical statistical system 
with infinitely many states describes the atom and its en- 
vironment or the atom and the vacuum. The quantum 
statistical features become apparent if one concentrates on 
a subsystem that describes the isolated atom. 

Our classical statistical description of quantum systems 
has three crucial ingredients. (1) Probabilistic observables 
have in a given quantum state only a probabilistic distri- 
bution of possible measurement values, rather than a fixed 
value as for classical observables in a given state of the 



classical ensemble. Probabilistic observables obtain from 
classical observables by "integrating out" the environment 
degrees of freedom. The fact that this map is not invertible 
avoids conflicts with the Kochen-Specker theorem 0, 0]. 
The probabilistic nature of the observables can be under- 
stood as a result of "coarse graining of the information" , 
starting from deterministic observables on a suitable level. 
Alternatively, this concept may be used as a basic defini- 
tion of observables 0] . 

(2) Conditional probabilities are used for a computation 
of the probabilities for the possible outcomes of two mea- 
surements of observables A and B. In particular, if B is 
measured after A, the outcome of the measurement of B 
depends on the previous measurement of A. Conditional 
probabilities induce the concept of quantum correlations, 
and we propose that quantum correlations rather than clas- 
sical correlations should be used for a description of two 
measurements. This avoids conflicts with Bell's theorem 
[§, [13] ■ The classical correlations are actually not uniquely 
defined for the quantum system - they depend on proper- 
ties of the environment. 

(3) The unitary time evolution of quantum mechanics is a 
special case of a more general classical evolution which can 
also describe the phenomena of decoherence and syncoher- 
ence. We propose that the unitary time evolution reflects 
the isolation of the subsystem and corresponds to a partial 
flxed point (or better "fixed manifold" ) of the more general 
classical time evolution. 

A classical statistical description of quantum mechanics 
should not be confounded with a deterministic description. 
The no go theorems for large classes of local deterministic 
"hidden variable theories" remain valid. We rather take 
the attitude that a probabilistic approach is appropriate 
for the basic setting of both the quantum and the classical 
world. Deterministic behavior is a particular case (albeit 
rather genuine) which originates from the collective prop- 
erties of many degrees of freedom, as the laws of thermo- 
dynamics, or the motion of planets which are composed of 



2 



many atoms. 

Probabilistic observables 

The most crucial effect of the embedding of the sub- 
system into classical statistics for infinitely many states is 
the appearance of probabilistic observables for the descrip- 
tion of the subsystem. For a given state of the subsys- 
tem - which will be associated with a quantum state - they 
have a probability distribution of values rather than a fixed 
value as for the standard classical observables. For subsys- 
tems that are equivalent to M-state quantum mechanics 
the spectrum of the possible outcomes of measurements for 
the probabilistic observables contains at most M different 
real values 7q. In a given quantum state the probabilistic 
observable is characterized by probabilities Wa to find 
where Wa depends on the state. The simplest example are 
two-level observables, which can resolve only one bit, such 
that 7i = 1 , 72 = —1. 

Let us consider a subsystem that can be described by n 
discrete classical two-level-observables A^''\ The subsys- 
tem is embedded into a classical statistical ensemble with 
infinitely many states labeled by r. On this level the ob- 
servables A^'^^ are standard classical or deterministic ob- 

(k) 

servables. They can only take fixed values A)- = ±1 for 
any state r of the classical statistical ensemble. We as- 
sume that these observables form a basis in a sense to be 
specified later. The simplest quantum mechanical analogue 
for n = 3 is two-state quantum mechanics, with A^''^ cor- 
responding to three orthogonal "spins" in an appropriate 
normalization. This may be viewed as an atom with spin 
one half where only the spin degree of freedom is resolved, 
as for example in Stern-Gerlach type experiments. For ar- 
bitrary n, we denote the expectation value or average of 
A'^''^ by pk, 



Pk 



(AW> -^P.AW , -l<p, <1. 



(1) 



Here we have formulated A'^^^ as classical observables 
with classical probabilities Pr > for the states r of the 
classical statistical ensemble. As usual, one has X^tP^ = ^■ 
(Not all classical probability distributions {pr} correspond 
to quantum systems, and we will discuss restrictions be- 
low.) The probabilistic observables associated to these 
classical observables have probabilities w^^^ = (1 ± p/£)/2 

(k) 

to find 7i_2 = il- We can compute by summing the 

ik\ 

probabilities Pt over all classical states for which Ar — 1, 



and similar for 



This maps the classical observable, 

(k) 

as characterized by the values Ar in every classical state 

T, to a probabilistic observable characterized by 7a and 

(fe) 

The map from the classical observables to the proba- 
bilistic observables is not invertible. A different classical 
observable A''*"'' , with different values A'r''^ in the classical 
states, may be mapped to the same probabilistic observ- 
able A^'^'^ . For this it is sufficient that the probabilities w'"^'^ 
are the same for every probability distribution {pt} which 
corresponds to a quantum system. This lack of invertibility 



avoids confiicts of our classical statistical description with 
the Kochen-Specker theorem as we we will discuss at 

the end of this note. Since the classical observables ^r*^'' 
contain much more information than the associated proba- 
bihstic observables, they "measure" properties of both the 
quantum system and its environment. The transition to 
probabilistic observables "integrates out" the environment 
degrees of freedom 3] ■ 

Quantum system and system observables 

We will assume that the n numbers pk are the only in- 
formation that is needed for a computation of expecta- 
tion values for the "system observables" of the subsystem. 
In this sense, the state of the subsystem is characterized 
by the n expectation values of the basis observables A^'^^. 
Only a very limited amount of the information contained 
in the probability distribution {pr} for the total system is 
needed for the subsystem. Quantum mechanics is "incom- 
plete statistics" in the sense of ref. [ll|- More precisely, 
system observables are those classical observables that lead 
to probabilistic observables A for which the probabilities 
Wa can be computed in terms of {pfe}. Then the expecta- 
tion values of functions f{A) can also be computed from 
{pk}, if {A)) = J2afha)wa{pk)- Wc wiU assumc that the 
relation between Wa and pk is linear. 

Our first question concerns a classification of possible 
system observables for the subsystem. It is straightfor- 
ward to define rescaled observables cA^'^' by {cA'-'^'>)t- = 
= cpk- Furthermore, we can trivially shift 
the observable by a piece eg proportional to the unit ob- 
servable. The rescaled and shifted observables A obey 



{A)=Pke[^^+e[^ 



(2) 



where repeated indices are summed. Here we associate to 
each A'^^'^ an ri-dimensional unit vector e^^'^ with compo- 
nents e"m = S^. Then the vector e^"^-* reads e^"^-* = ce'^'"'^ if 
A = cA'^'^'K One may use c = h/2 if A describes a spin with 
standard units of angular momentum. Other units may 
be employed for alternative interpretations, as for exam- 
ple occupation number n = {1 + A'-^^)/2 which equals one 
for occupied and zero for empty. (Contrary to widespread 
belief, h is not a genuine property of quantum mechanics, 
but rather an issue of units.) In the following we mainly 

(A) 

discuss observables with Cq — 0. 

For an arbitrary system observable A we may write the 
expectation value (A) = J^a^a-'^^ipk), with Wa depending 
linearly on pk, as a linear combination {A) = J^k Ck{A^'^^). 
(If necessary, we substract an appropriate constant shift.) 
This should hold for all probability distributions {pr} 
which describe the quantum system. (Such observables 
may correspond to rotated spins in the quantum mechan- 
ical analogue.) We can associate to each such observable 



the vector e^^^ = CfcC 



(fc) M) 



Ck, such that eq. 



remains valid. Explicit constructions of such probabilistic 
observables can be found in The possibility to write 
the expectation value in the form (2) is a necessary, albeit 
not sufficient condition for a classical observable to be a 
system observable. 
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Probabilistic observables are characterized by the spec- 
trum {7a} of possible measurement values, and the as- 
sociated probabilities Wa- The multiplication of the ob- 
servable by a constant c and the addition of a piece pro- 
portional to the unit observable are always defined by 
7a c^a + eo- However, the sum and the product of 
two probabilistic observables A, B are, in general, not de- 
fined. If system observables exist for arbitrary real Ck this 
allows a definition of linear combinations of probabilistic 
system observables as represented by linear combinations 
of the associated vectors e. For C = caA + cbB one has 
e(c^) = CAe(^) + CBe(«) , (C) = ca{A) + cb{B). 

Next we are interested in some general properties of the 
basis observables. For example, one typical question may 
ask if two of them can have simultaneously a sharp value. 
A "classical eigenstate" of a probabilistic observable A is an 
ensemble for which A has a "sharp value" with vanishing 
dispersion, {A^) — (A)^ = 0. For example, the eigenstate of 
the observable A'^^^ with "classical eigenvalue" {A'^^'^) = 1 

is characterized hy Pr — Q whenever A^r'' = —1- The 
maximal number of sharp "basis observables" A'^^'^ can be 
characterized by the "purity" P of the ensemble, 

P ^ PkPk- (3) 

(Note that P depends on the set of basis observables that 
characterize the subsystem.) For P = one finds equipar- 
tition with (A) — for all two level observables. Obviously, 
M sharp observables require P > M, since at least for M 
values of k one needs pk = ±1. For an ensemble with P = 1 
at most one observable A'-''^ can be sharp. Typical classi- 
cal ensembles that describe isolated quantum systems will 
have a maximal purity smaller than n, such that not all 
can have sharp values simultaneously. We recall that 
the purity is a statistical property involving expecta- 
tion values. For a given classical state r all observables 
A^'^^ have a sharp value. 

Density matrix and wave function 

For M an integer obeying M > P + I we may represent 
the Pk by an M x M hermitean "density matrix" p^^: 

P=^il + PkLk). (4) 

The matrices Lfc 3 are 5'C/(M)-generators k = L.A'P - 1, 
obeying 

trife = , Ll = l , ti{LkLi) = MSki, (5) 

{Lk,Li} = 2Skl + 2dklmL,n , [Lk, Li] = 2ifklmL,n. 

The matrix p has indeed the properties of a density matrix, 
trp= 1 , p„„ >0 , trp' - ^(1 + P) < 1. (6) 

In analogy to quantum mechanics, a "classical pure state" 
obeys = p and therefore requires P — M — 1. We can 
also associate to any system observable A an operator A 
such that the quantum mechanical rule for the computation 
of expectation values holds (efe = e^f"^ ^ Ck) 

A = CkLk , {A) = pkCk = tr(pi). (7) 



We will concentrate on the minimal M needed for a given 
maximal purity of the ensemble. For n — — 1 the 
operators for the basis variables A'^^^ are given by L^. If 
n < A/P — 1 only part of the Lk are used as a basis for the 
observables. 

Many characteristic features of the system observables A 
can now be inferred from standard quantum mechanics, as 
demonstrated by a few examples. For M = 2 at most one 
of the three possible two-level-observables A^^^^ can have a 
sharp value. This occurs for an ensemble where the den- 
sity matrix describes a quantum mechanical pure state, 
p = i(l ± i), trp2 = 1 + itri2 = 1^ with iW = Lk = r^. 
For such an ensemble the expectation value of the two or- 
thogonal two-level-observables must vanish, (A^'^) = for 
I ^ k. Thus, whenever one basis observable is sharp, the 
two others have maximal uncertainty, as for the spin one- 
half system in quantum mechanics. 

Another example for M = 4 describes two different two- 
level-observables (say the z-direction of two spins Si, Si) 
by Li — diag (1,1,— 1,-1) and L2 — diag (1,-1,1,-1). 
The product of the two spins is represented by L3 = 
diag (1,-1,-1,1). Consider an ensemble characterized 
by p3 — —1 , pi — p2 ~ 0. For this ensemble one 
has = = such that for both two-level- 

observables the values +1 and —1 are randomly distributed 
in the ensemble. Nevertheless, [A^^'^) = — 1 indicates that 
the two "spins" are maximally anticorrelated. Whenever 
the first spin takes the value +1, the second one necessar- 
ily assumes —1 and vice versa. At the end of this note 
we discuss that pure states of this type show the charac- 
teristics of an entangled quantum state. Our third exam- 
ple considers the observable S corresponding to the sum 
S = Li + L2. For the particular pure state density ma- 
trices {pm)af3 = SmaSmf3 One has (S) = 2 (for m = 1), 
(S) = (for m = 2,3) and (S) = -2 (for m = 4). Thus 
S has the properties of a total spin, composed of two half 
integer spins (say Sz = Si + S^). 

The density matrix can be diagonalized by a unitary 
transformation. In consequence, any pure state density 
matrix can be written in the form p — U PmU'' for a suitable 
U, with UU'' = 1. This allows us to "take the root" of 
a pure state density matrix by introducing the quantum 
mechanical wave function ipa as an Af-component complex 
normalized vector, ip'^ip — 1, 

Pal3 = ^a^p , fpa = Uaf}{lpm)l3 , 

(^m);3 = , {A) = ^P^A^p. (8) 

All the usual rules for expectation values in quantum me- 
chanical pure states apply. 

Pure states play a special role since they describe classi- 
cal ensembles with minimal uncertainty for a given integer 
P. For M = 4 a pure state has purity P = 3 and three 
different observables can have sharp values, correspond- 
ing to the maximum number of three commuting quan- 
tum mechanical operators. For P > l,Af = P + 1 and 
{Lk, Li} = 2Ski + 2dkimLm the condition for a pure state, 
p2 = p or pk[pidkim - (M - 2)Skm] = 0, is not automati- 
cally obeyed for all pk with pkPk ~ P- For a pure state, the 
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"copurity" C = tr[(/5^ — pf'] must also vanish. While the 
purity P is conserved by all orthogonal SO(n) transforma- 
tions of the vector (pk), pure states are transformed into 
pure states only by the subgroup of SU{M) transforma- 
tions. The SU{M) transformations are realized as unitary 
transformations of the wave function tl), where the overall 
phase of remains unobservable since it does not affect p in 
eq. ([5]) . In our classical statistical description of quantum 
phenomena, the particular role of the classical pure states 
constitutes the basic origin for the unitary transformations 
in quantum mechanics. Just as in quantum mechanics, we 
can write the density matrix p for an arbitrary ensemble 
as a linear combination of appropriate pure state density 
matrices. 

Algebra of probabilistic quantum observables 

The vector e^^^* is sufficient for a determination of the 
expectation value of A in any state of the subsystem (char- 
acterized by Pk). However, the typical observables for 
the subsystem are probabilistic observables and we further 
have to specify the probability distribution for a possi- 
ble outcome of measurements for every state {pk\- This 
is needed for a computation of expectation values {A^) 
of powers of A. We will next concentrate on "quantum 
observables". They constitute a subclass of the system 
observables for which the operators associated to A^ are 
given by A^ . For a given M we consider observables 
with a spectrum of at most M different values 7a. We 
first consider a non-degenerate spectrum of M different 
7a and identify a — a. The probabilities to find 7q, in 
the state p^ of the subsystem are denoted by Wa{pk) > 

. Y.a^a{pk) = 1 , (A) = Y.a^a{pk)la = Pk^k- 

The expectation value {A) = iv[pA) is invariant under 
a change of basis by unitary transformations, p p' — 
UpU^ , A A! — UAU^. We may choose a basis with 
diagonal A' = diag{\i, Am) , {A) = Y,a P'aa^a, sug- 
gesting that the spectrum 7^ can be identified with the 
eigenvalues Aq, of the operator A, and Wa(pk) = p'aa- 

Among the classical observables we therefore consider 
the subclass of quantum observables which generate prob- 
abilistic observables with the property 

(^> ^^Wa,{pk)la , Wcipk) = (9) 

To each quantum observable, we can associate a hermitean 
quantum operator, with a spectrum of possible measure- 
ment values given by the eigenvalues A^ = 7q of the op- 
erator. The classical probability for the outcome of the 
measurement in a given state is the corresponding diag- 
onal element of the density matrix in a basis where A is 
diagonal. We describe the mathematical structures related 
to quantum observables in more detail at the end of this 
note, where we also deal with degenerate spectra of less 
than M different 7a. There we will construct an explicit 
realization for the quantum observables. 

We can now consider powers of the probabilistic ob- 
servable A, {AP) = Y.aWc{pk)ll- The observable A^ 
should belong to the observables of the subsystem, since 
it can be associated with p measurements of A, multi- 



plying the p measurement results that must be identi- 
cal. We can therefore associate an operator Ap to the 
observable A^ , {A^) — lY{pAp). For quantum observ- 
ables this is realized by Ap — A^ and we conclude {A^) — 
tv[pAP). For {A^) ^tripA^) a classical eigenstate of A 
obeys [tr(pA)]^ —ti:{pA^) and the possible classical eigen- 
values are the eigenvalues of the operator A. If a pure state 
is an eigenstate of A one has Aij: = AtA with A = Aq = 7q. 
one of the eigenvalues of A. 

Consider now the classical ensemble of states r which 
describe the subsystem together with its environment. It 
is straightforward to characterize the properties of the clas- 
sical observables At that are mapped to the probabilistic 
quantum observables. First, the spectrum of possible out- 
comes of individual measurements equals the (sharpe) val- 
ues of the classical observable Ar in the classical states r. 
It consists of the eigenvalues of A. Second, the probabili- 
ties for the states p-r must be distributed such that Wa can 
be written as a linear combination of pk {A'^'^^ ) for all states 
with P < M — \. At first sight this second requirement 
for a classical observable to be a quantum observable may 
seem rather special. However, in many circumstances the 
quantum observables are related to the basis observables 
by simple "physical operations" . For example, the spin 
observable in an arbitrary direction obtains from the three 
basis observables A'^^\M — 2) by rotation, such that the 
relation ^ arises naturally [4]. Other simple operations 
are the addition and multiplication of "commuting observ- 
ables" , as we will explain below. At this point it may be 
worthwhile to pause. We have selected a set of classical 
observables with a remarkable property: whenever the en- 
semble obeys a simple constraint on the purity, P < M ~1, 
all the laws of quantum mechanics apply for these classical 
observables, as for example Heisenberg's uncertainty rela- 
tion based on the commutator of the associated operators. 

In summary, we define the quantum observables as a sub- 
class of the classical observables with a spectrum of at most 
M different possible measurement values 7a. Furthermore, 
the associated probabilities Wa should be given by the re- 
lation ^ which is linear in pk, with coefficients depending 
only on the observable. This implies the relation Q and 
defines e^"^"* . (We may add a piece proportional to the unit 
observable.) It is then also possible to compute the expec- 
tation values (AP) in terms of the information contained 
in the quantum state through an expression linear in the 
Pk- Therefore (A^) does not involve any properties of the 
environment - these expectation values are completely de- 
termined by the subsystem. To any quantum observable we 
can associate a unique vector (e^) and therefore a unique 
quantum operator. In turn, to each quantum operator we 
can also associate a unique probabilistic observable, with 
Wa given by eq. On the level of probabilistic ob- 

servables the quantum observables can therefore be fully 
characterized by the vector (e^). 

The correspondence between classical probabilistic ob- 
servables and quantum operators allows for the introduc- 
tion of a "quantum product" AB between the probabilistic 
observables, which is associated to the operator product 
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AB. Together with linear combinations, this defines an 
algebra for the probabilistic quantum observables. The al- 
gebra of probabilistic quantum observables is isomorphic to 
the algebra of quantum operators. For both, it can be ex- 
pressed as operations in the space of (efc). In particular, we 
observe — AA. We will see below that a particular use 
of the "quantum product" AB arises from an investigation 
of conditional probabilities. 

In general, a probabilistic quantum observable A de- 
scribes an equivalence class of classical observables that 
all lead to the same quantum operator A. The product 
AB then induces a product structure between equivalence 
classes. If a representative for each class is selected one 
can also define AB on the level of classical observables [llj . 
We emphasize that the product AB is not the classical or 
pointwise product A ■ B where [A ■ B)r = AtBt- 

Quantum correlations 

Beyond a rule for the computation of expectation val- 
ues of observables, any theory must provide a prediction 
for the outcome of two consecutive measurements. After a 
first measurement of the observable A the result of a sub- 
sequent measurement of another observable B is, in gen- 
eral, influenced by the first measurement. The measure- 
ment of A has changed the ensemble and the knowledge of 
the observer. For simplicity we concentrate on two-level- 
observables, {A^) = (B^) = 1, = B^ = 1. The proba- 
bility of finding B — 1 after a measurement A — 1 amounts 
to the conditional probability (w^)^. After the measure- 
ment A = I the ensemble must be an eigenstate to the 
eigenvalue A = 1 - otherwise a subsequent measurement of 
A would not necessarily yield the same value as the first 
one. Then B is measured under this condition. 

We take here the attitude that there is only one given re- 
ality, but physicists can at best give a statistical description 
of it. The "fundamental laws" are genuinely of a statisti- 
cal nature and only establish relations within different 
possibilities for the history of the real world. Measuring 
for an observable ^ in a given state the value 7a simply 
eliminates the other possible alternatives (which may have 
nonvanishing probabilities Wa^a)- After the measurement 
of A it makes only sense to ask what are the outcomes of 
other measurements under the condition that A has been 
measured to have the value 7a. 

Correspondingly, the measurement correlation or condi- 
tional correlation {BA)m multiplies the measured values of 
A and B, weighed with the probabilities that they occur 

(BA)^ = «)^<,-(^«^)^<, 

-K)^<, + (^^)^<„ (10) 

with g the probability that A is measured as ±1 in the 
state s and {A) — w^ ,, — w^ ,, — tr {pA), + ^ = 1, 
(w!^)^ + (w^?)^ = 1. The conditional correlation needs 
a specification of the conditional probabilities as (w^.)^. 
For their computation we use the prescription that after 
the first measurement A = I the density matrix pA+ must 
describe an eigenstate of A, tr{ApA+) = 1. This effect of 
a first measurement may be called state reduction. The 



subsequent measurement of B then involves this state, 

iw^)i-iw^)i=triBpA+). (11) 

The relation (fTTI) is based on the property that the quan- 
tum observable B obeys eq. ([2]) for an arbitrary quan- 
tum state of the subsystem. Our assumption is therefore 
that after the measurement of A the classical ensemble still 
describes a quantum system. This seems reasonable for 
appropriate measurements since otherwise the first mea- 
surement destroys the isolation of the subsystem instead 
of only changing its state. This assumption has far reach- 
ing consequences, however. It necessarily implies eq. (|lip 
and excludes the option of using the classical correlation 
for the general description of subsequent measurements, as 
we will see below. 

For M = 2 the matrix pA+ is unique, pA+ — 5(1+^)1 
such that 

i^±)i = l± Jtr(M) , iw^)^ = i T ^tr(M). (12) 

However, for M > 2 one has ti{ApA+) = 1 for 

PA+ = jjil + A + X), tr(ix) = 0, 

trX = , trX^ = M{P - 1), (13) 

with 

- PA+ = ^iX' + {A X}) - (1 - PA+- (14) 

A necessary condition for pA+ describing a pure state is 
trX2 = M{M-2) , P = M-1, which imphes X ^0 only 
for M = 2. 

We may distinguish between a "maximally destructive 
measurement" where all information about the original en- 
semble except for the value of A is lost, and a "minimally 
destructive measurement" for which an original pure state 
remains a pure state after the measurement. A maximally 
destructive measurement is described by X = in eq. p^ . 
leading to 

{B)a+ = {wf)i - {w^)i = ^tr(Bi) = (i?A).„a,. (15) 

Here we denote by (i?^)inax the conditional correlation 
for maximally destructive measurements and use that pA- 
obtains from pA+ by changing the sign of A in eq. 
(with X = 0). We can use {BA)^^^^ for the definition of 
a scalar product between the observables B and A, since 
it does not depend on the initial ensemble. The two-level 
observables A^'^^ form an orthogonal basis in this sense, 

(aC^)^^)^^^ = 4,. 

A minimally destructive measurement oi A — 1 projects 
out all states with A = —1, without further changes of the 
original ensemble and associated density matrix p, 

PA± = ^.^l.^.A l±A)p{l±A). (16) 
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With 

{BA)^ - tr(BpA+)w+,, - tY{BpA-)w^^,, (17) 

and 

<, = (l±(A))/2 (18) 

this yields 

{BA)„,^hr{{A,B}p). (19) 

The conditional correlation for minimally destructive mea- 
surements in the classical statistical ensemble corresponds 
precisely to the expression of this correlation in quantum 
mechanics. It involves the anticommutator and is there- 
fore related to the quantum mechanical operator product. 
On the level of probabilistic observables we can express the 
conditional correlation (BA) in terms of expectation values 
of the quantum product AB 

{BAU = liiBA) + {AB)), (20) 

demonstrating the close connection between the quantum 
product and the conditional correlation. 

The two point correlation is commutative, {BA).fn = 
{AB),n- We will postulate that the two point correlation 
([20)1 describes in general the correlation between two mea- 
surements for quantum systems and call it quantum cor- 
relation^^ . We have motivated its use by two subsequent 
measurements, but the order of the measurements does 
actually not matter. Any rule for the computation of a 
correlation between two measurements needs some choice 
of an appropriate product of observables. At first sight, a 
possible alternative choice may be the classical correlation 
which is based on the classical product A - B, as defined on 
the level of the classical ensemble, {A ■ B)t = ArBr- We 
will see below, however, that A ■ B is usually not a quan- 
tum observable and can therefore not be determined from 
the information characterizing a quantum state, i.e. {pk}- 
Its use would therefore need information which relates to 
the environment, but not only to the subsystem. In other 
words, the use of the classical product corresponds to a 
state reduction after the first measurement where substan- 
tial information about the relation between the subsystem 
and the environment is retained. This is not what a usual 
measurement in a quantum system does and the classical 
correlation can therefore not serve for the description of 
such measurements. For any measurement where the out- 
come (including the state reduction) can be expressed in 
terms of information available for the subsystem the choice 
of the quantum product seems natural. It retains a max- 
imum of the information which is available in the subsys- 
tem. 

Since the correct choice of the correlation for a descrip- 
tion of two measurements is crucial we may describe the 
issue in some more detail. In any statistical setting one 
should distinguish the probability w++ that the two level 
observables A and B are measured with values A = 1, B = 



1 from the probability p++ that they "have" the values 
A = B = 1 in the classical statistical ensemble. On the 
classical statistical level one can express 

P++ = \{l + {A) + {B) + {A-B)) (21) 
in terms of the "classical" correlation 

{A- B) ^J^Pr^rBr- (22) 

T 

The probability p^^ does not specify a priori the condi- 
tional information relating two subsequent measurements, 
which is necessary for In general, one needs a sep- 

arate prescription how w++ should be computed from the 
available statistical information. Only under particular cir- 
cumstances, one may be able to identify w++ with 
In other words, w++ = p++ is an additional basic as- 
sumption which does not hold true in general. This con- 
trasts to the case of a single measurement for A, where 
w+ = p+ = {1 + {A))/2 by definition. 

If we try the identification w++ = we run into se- 
vere problems. In general, the classical correlation {A ■ B) 
is not computable in the subsystem. It is a property of 
the system and its environment and cannot be obtained 
from the information characterizing the quantum state, i.e. 
from Pk ■ This problem is closely linked to the fact that the 
mapping from the classical observables described by Ar 

(A) 

to the probabilistic observables described by 7a = ±1 
and = w'^^ is not invertible. The observable Ar de- 
scribes properties of the subsystem and its environment, 
while the characterization of the environment is only lost 
on the level of the probabilistic observable A. The use 
= corresponds to an implicit definition of the 
conditional probability for two measurements where after 
the first measurement A = 1 all classical states r for which 
Ar = —1 are eliminated. This elimination process depends, 
however, on the particular observable Ar and therefore also 
refiects properties of the environment, not only of the sub- 
system. Two observables Ar and A'r, which lead to the 
same probabilistic observable A, produce, in general, dif- 
ferent classical products {A ■ B) ^ {A' ■ B) . (These issues 
are discussed in more detail at the end of this note.) 

If the state of an isolated subsystem can be described by 
{pk}, this information must also be sufficient for a predic- 
tion of the outcome of two measurements. The probability 
W++ must be computable in terms of {pk}- For this reason 
we employ the quantum correlation (AB) and postulate 

w++^^{l + {A) + {B) + {AB).^), (23) 

as advocated already before for subsequent measurements. 
(A different motivation for the use of the quantum corre- 
lation is given in [ll|.) Indeed, now w++ can be expressed 
in terms of pk 

= J (l + ei-)ef ) + p,[ei-) + ef ^ + d™.e(;f )ej-)]) 

(24) 
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The other probabihties etc. obtain from eqs. ([23)) . 

([M)) by appropriate changes of relative signs. 

The prescription for the probabihties of the outcome 
of two measurements influences strongly the statistical 
properties of correlations. For example, one may ask if 
a "hidden variable theory" is possible, where there ex- 
ist discrete functions A{v) — ±1 , B{v) = ±1 such that 
{AB)m — J dvp{v)A(v)B{v) with some probability distri- 
bution p{v). Just as in the quantum mechanics, this can 
be excluded by the use of Bell's inequalities In our 
classical statistical setting the correlation function or 
the probabilities for the outcome of two measurements are 
exactly the same as in quantum mechanics. On the other 
hand. Bell's inequalities apply to the classical correlation 
{A ■ B). Besides theoretical arguments we have therefore 
also experimental evidence that in general the classical cor- 
relation function should not be used for the description of 
the outcome of two measurements. 

Quantum time evolution 

We have seen how quantum structures can arise from the 
description of subsystems where the "state of the system" 
is described by n expectation values of "basis observables" . 
For P < n the appearance of "non-commuting structures" 
is mandatory. The question remains why such quantum 
systems are omnipresent in nature, in contrast to "com- 
muting structures" for P = n. The answer may be rooted 
in stability properties of the time evolution. 

Let us consider some continuous time evolution of the 
classical probability distribution {pr}- It relates the en- 
semble at time ^2 to the ensemble at some earlier time ii, 
and induces a transition from pk{ti) to pk{t2), 

Pr{t2) = Srp{t2,tl)pp{ti) , Pk{t2) = Skl{t2,tl)pi{ti). (25) 

We may decompose the transition matrix Ski into the prod- 
uct of an orthogonal matrix Ski , which preserves the length 
of the vector {pi . . . , p„) and therefore the purity, and a 
scaling d, Ski = Skid. For an infinitesimal evolution step 
this implies 

dtPkit) = Tkipi{t) + Dpk{t) , D ^ dtliid{t,h), 

Tki = -Tik^dtSk,n{t,h)Slra{t,h). (26) 

For a given maximal purity during the evolution, eq. (|26p 
can be rewritten as an equation for the density matrix p, 

dtPaP = — i[H,p\al3 + Ra/SfS (^P-jS - 'j^^J^^ 

+ D{p^p - ^S^p). (27) 

This corresponds to a split of the infinitesimal SO{n) trans- 
formation dpk — Tkipi into a unitary part corresponding to 
the subgroup SU{M) and represented by the hermitean 
Hamiltonian H = HkLk + Hq, and remaining rotations of 
SO{n)/SU{M) represented by R or fku 

Tki = —"^fkhnHm+Tkl, [Lk, Li] = 2ifklmL„i, 

Tki = -JTRa0-f5{Lk)l3a{Ll)jS- (28) 



In general, H, R and D may depend on pk- 

We are interested in possible partial fixed points of the 
evolution for which R = and D = 0, while H is in- 
dependent of Pk- (Partial fixed points of this type have 
been found explicitly in the classical time evolution of 
non-relativistic boson fields [i3|-) Then eq. ([27]) reduces 
to the linear von-Neumann equation for the density ma- 
trix. In case of a pure state density matrix this implies 
the Schrodinger equation idtip = Hip. One recovers the 
unitary time evolution of quantum mechanics. The more 
general evolution equation away from the "unitary partial 
fixed point" can describe "decoherence" as a decrease 
of purity for D < 0, or "syncoherence" as the approach to 
the pure state partial fixed point with increasing purity for 
D > 0. The latter typically accounts for a situation where 
the subsystem described by the observables A^*^' can ex- 
change energy with the environment. An example is the 
evolution from a mixed state of atoms in different energy 
states to a pure state of atoms in the ground state by virtue 
of radiative decay of the excited states. A static pure state 
density matrix obtains as usual as a solution of the quan- 
tum mechanical eigenvalue problem Hip = Ejip. We sug- 
gest that the omnipresence of quantum systems in nature 
is due to the existence of such partial fixed points which 
reflect the isolation of the subsystem. 

If H is independent of pk it can be considered as an ob- 
servable of the subsystem, H — HkLk + Hq, with fixed 
coefficients Hk,Ho. By Noether's theorem it is associated 
with the energy of the subsystem, where Ej denotes the 
possible energy eigenvalues. (If one wants to use standard 
energy units one replaces H H/h.) On the other hand, 
R and D account for the interactions of the subsystem 
with its environment. They vanish in the limit of "per- 
fect isolation" of the subsystem. If the interactions with 
the environment are strong enough, the subsystem is typ- 
ically not evolving towards the equipartition fixed point, 
Pafi — jjSafi, but rather towards a Boltzmann type density 
matrix p ^ exp[— /3(i?-f /j-iTVi)] (for conserved quantities Ni 
and chemical potentials pi), which may be close to a pure 
state density matrix if the temperature T = j3~^ is small 
as compared to the typical separation of the two lowest en- 
ergy eigenvalues Ej. In contrast, if the isolation from the 
environment becomes efficient fast enough, the subsystem 
follows subsequently its own unitary time evolution, as well 
known from quantum mechanics. 

At this point we may recapitulate what we have achieved. 
Starting form a classical statistical ensemble we have iden- 
tified a class of classical observables which have the same 
expectation values and admit the same algebra as the op- 
erators in a corresponding M-state quantum system. This 
holds provided the "purity condition" P < M— 1 is obeyed. 
Also the conditional correlations which describe measure- 
ments of two such observables are the same as in the quan- 
tum system. Furthermore, we have found the criteria for 
the time evolution of the classical probability distribution 
that ensure a unitary evolution of the density matrix for 
the corresponding quantum system. Obviously, these prop- 
erties are related to the specific subset of classical observ- 
ables that can describe the subsystem, in the sense that 
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no further information about the environment is needed 
for a prediction of the outcome of measurements in the 
subsystem. In the remaining part of this paper we wiU dis- 
cuss the properties of these specific quantum observables 
in some dctaiL 

Quantum observables 

The quantum structures for probabilistic observables dis- 
cussed so far do not need any specification of the represen- 
tation as classical observables. In eq. (jlj many classi- 
cal systems with different states r, classical probabilities 
Pr and classical values of the observable in a given state, 
A)- , may describe the same quantum properties of the 
subsystem. Nevertheless, the implementation of quantum 
observables implies certain restrictions on the possible clas- 
sical realizations. We will discuss those in the following, 
mainly for the purpose of conceptual foundations. 

Classical observables are maps from the set of probability 
distributions Q, with elements {pr} = [pi ■ ■ ■ ,Pz), to real 

numbers, f7^-^']R, with {p^} {A'-'^'^) = Y,rPrA^^\ (For 
simplicity we employ a language with a finite number z 
of classical states, which can be extended to an infinite 
set at the end in some specified limiting procedure.) We 
will restrict the discussion to those elements of f2 which 
correspond to "quantum states" , i.e. which obey the bound 
for the purity of the ensemble. In general, the classical 
observables A'^'-^'^ describe the system and its environment. 
(We use "system" for the (isolated) subsystem or quantum 
system from now on.) 

We are interested in the subclass of quantum observables 
^(Q) whose expectation values and quantum correlations 
can be computed in the system. Quantum observables are 
classical observables with special properties. First, for a 
M-state quantum system, a quantum observable has at 
most M different classical values Af^'^ = ja- (For a non- 
degenerate spectrum we can identify a = a — 1 . . . M. ) For 
any given quantum observable we can classify the classical 
states T according to the value of A^^^ , r = (7a,o'-y„), 
where for each given 7^ one typically has a large degener- 
acy of classical states, labeled by cr^^ . We can define the 
probability Wa for the occurrence of a possible measure- 
ment value 7a as 

As a crucial ingredient, Wa must be computable from the 
quantities which specify the quantum state, i.e. from the 
expectation values of the "basis observables" {A'-'''') — pk- 
We assume a linear relation 

Wa{pk) = ^ CakPk + CaO- (30) 
k 

The probabilities must be normalized, '^^'Wa{pk) = 1, for 
arbitrary pk, which implies the conditions 

^Cafc^O, ^CaO = l. (31) 



For quantum observables the coefficients Cak,Cao are re- 
stricted further since Wa{pk) must obey eq. For non- 
degenerate eigenvalues one needs 

Cafe = J^{ULkU'')aa , CqQ = , (32) 

for some suitable unitary matrix U. 

Thus quantum observables are defined by two properties: 
(i) the restriction of the spectrum to at most M different 
values, (ii) the "quantum determination" of probabilities 
WaiPk)- While (i) only involves a property of the classical 
observable, the second restriction (ii) depends on the se- 
lection of possible quantum states out of the most general 
probability distributions {pr} and on relations to the basis 
observables. For the specification of a quantum observable 
we need at least 7a and Cak,Cao- A quantum observable 
A^*^' has the important property that its classical product 
A(Q) . (defined by (A^O) • A^Q'))^ = (Ai^^)"^) is again 
a quantum observable, with spectrum (7^) and the same 
Wa as for This extends to higher polynomials and 

arbitrary functions /(A^'3))^ 

We can now associate to any classical quantum observ- 
able A^'^'> a probabilistic quantum observable, A''^) — > A, 
which is characterized by the spectrum of possible mea- 
surement values (7a) and the associated probabilities Wa- 
Only this information will be needed for a computation of 
expectation values ((A'^'^^)^) in a "quantum state" of the 
system, while the detailed form of {pr} is not relevant. On 
the level of classical observables the quantum observables 
are characterized by a distribution of values A'^'^ = 7^ 
for the classical states r. This distribution still contains 
much more information than the spectrum 7a and the as- 
sociated probabilities Wa- Therefore a'^^ still describes 
the system and partly the environment. Only on the level 
of probabilistic observables A the parts of Ar"^' relevant 
for the environment are projected out, such that A only 
"measures" properties of the system. 

We also can associate to every A^'^^ a quantum operator 
A by a map A^'^^ A. It is constructed from eq. ([7]) by 
observing 

(AiQ)) ^ Y.^a{ca., + Y,Ca.kPk)^e'^HY.e't'pk. 

a k k 

(A) \ - (A) \ ^ 

Go = 2^7QCa,0 , el ^ 2^JaCa,k, 

a a 

^ = ^0^^ + E ^''^ifc ' (^*'^^> - tr(ip). (33) 
fc 

We note that the map ([55]) is possible for arbitrary proba- 
bilistic system observables obeying eq. (pO|) . Without the 
restriction of the type (j32p for quantum observables, how- 
ever, {A^'^^y will, in general, not be mapped to A^. A 
simple example is the random two-level observable R, with 
71 = 1 > 72 = -1 , Cafe = , cio = C20 = 1/2. It is mapped 
to i = 0, while i?2 = 1. 

In turn, we have a map from the space of quantum op- 
erators O to the space of probabilistic observables V, since 
for every operator A the spectrum {7a} is defined, and the 
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probabilities Wa can be computed for all quantum states 
{pk} or density matrices p. The latter obtain from the 
diagonal elements {UpU^)aa, with U the unitary matrix 
used for the diagonalization of A. 

However, the map from the ensemble of classical quan- 
tum observables Q to the space of quantum operators O 
is not invertible. The classical observables A^'^'> involve a 
specification of A^'' for every classical state r, which is 
much more information than contained in the coefficients 
Ca.k , Cafi- We may encounter situations where a quan- 
tum observable B is mapped to an operator B, while also 
a function f{A) of a different quantum observable A is 
mapped to the same operator, f{A) = B. (Here f{A) is 
an operator valued function, while f{A) is based on the 
classical product A ■ A.) Such a situation does not imply 
an identification of the quantum observables, i.e. in gen- 
eral one has B ^ f{A). This lack of invertibility of the 
map Q — !■ O constitutes an important difference between 
our approach and many alternative attempts of a "clas- 
sical formulation of quantum mechanics" , which associate 
to each A a unique classical observable. For example, this 
is typically assumed for "hidden variable theories" . Also 
for the Kochen-Specker theorem the existence of a map 
A is a crucial hypothesis, which is not obeyed in 

our setting. 

On the level of classical observables we always can de- 
fine a linear combination, C = XaA^'^^ + \bB^^\ and the 
pointwise product, D = A^*^^ • i?''^', of two quantum ob- 
servables A^Q\B^Q\ where = A^A^^^ + XsB^^KOr 
A^^ Bt'^K However, in general neither C nor D are quan- 
tum observables. Consider the simplest case, M = 2, 
and the two basis observables A^-^^^ and A^^^ with spec- 
trum 7q^'' = ja"^ = ±1. A linear combination C ~ 
cos -dA^^^ + sin ■dA^'^^ has a spectrum 7^*^^ = ± cos ± sin 1). 
This observable has four different possible measurement 
values. It can therefore not be a quantum observable of 
the system with M = 2, even though (C) can be com- 
puted in terms of pi^2 = (A^^^'^^^). We conclude that 
the "rotated spin" , which corresponds to the operator 
A{'d) = cos'dA^^^ + sini9A(^\ has to be described by a 

quantum observable ^[^^^ that is again a two level observ- 
able with spectrum 7q, = ±1, rather than by a linear com- 
bination of A^^^ and A'^' of the type C. This necessity 
arises for each value of the angle and we have discussed 
in detail in Q that this needs a classical ensemble with 
infinitely many classical states r. The reader should note 
that one can define two types of linear combinations. On 
the level of classical observables one can define combina- 
tions of the type C, while on the level of operators or the 
associated quantum observables a natural definition is A{^) 
or A^'^\i!}). In general, a projection on the subsystem does 
not map C to A{'&) or the probabilistic observable A^'^^'d). 

The classical product D — A^^^ • A^^^ has a spectrum 
j^"^ = ±1. The condition (i) for a quantum observable 
is obeyed by D. However, the probability wi^^ for find- 
ing 7^^^ — 1 needs knowledge of the joint probability to 
find = 1,A(2) = 1 or = -1, A^^) = -1. This 



information cannot be extracted from pi and p2, which 
only yield the probabilities for finding A^^^^ = ±1 (namely 
= (1 ± pi)/2) or for finding A^^' = ±1 (namely 

Wj. = (1 ± pi)l2). Nor is it contained in the expecta- 
tion value p3 of the third basis variable for M = 2. We 
conclude that the classical observable D does not obey the 
condition (ii) for a quantum observable. 

Representation as classical observables 

After these general remarks we now present an explicit 
classical ensemble and quantum observables for a system 
with given M. We recall that the classical quantum observ- 
ables AS^^ which are mapped to a given A are not unique. 
Also the specification of the classical states r and the cor- 
responding construction of classical observables is not sup- 
posed to be unique. At the end all measurable information 
of the system can be expressed in terms of the expectation 
values of quantum operators, such that the details of the 
classical observables do not matter. Only the existence of 
the classical observables in a setting free of contradictions 
is therefore needed in order to demonstrate a realization of 
quantum mechanics as a classical statistical ensemble. 

It is sufficient to determine at least one classical quan- 
tum observable for every operator A, i.e. for every her- 
mitean M x M matrix. The quantum observable XA'-'^\ 
with A G IR, is mapped to the operator XA. We will there- 
fore restrict our discussion to operators with unit norm, 
say tr A^ = M . Also the addition of a part proportional 
to the unit observable translates for operators to the ad- 
dition of a corresponding piece proportional to the unit 
operator, A + c ^ A ~\- c. We can therefore restrict the 
discussion to traceless operators, trA = 0. We follow a 
simple construction principle. Consider first a single op- 
erator A with a spectrum of m{A) < M distinct eigenval- 
ues X^^^^{A). We associate to it rn{A) discrete classical 
states, labeled by a{A) = 1 . . .m{A). In these states the 
classical observable A'^'^'' , which is mapped to A, takes the 
values A^^^-, — ^a(A)(^)- ^'^'^ ^ second operator B 
with m{B) distinct eigenvalues X^f^g^ (i?). If this operator is 
"independent" we construct the direct product space with 
states T labeled by the double index r = (^a{A) , a{B)) , and 

Ai^^ = A^(^)(i), M'^^ = A^(^)(^). This is continued until 
all independent operators are included. As stated above, 
the resulting ensemble has infinitely many classical states 
T, since the number of independent operators is infinite. 
(A well defined sequence of subsequently included opera- 
tors induces a well defined limit process for the construction 
of the ensemble Our construction yields explicitly a 
classical quantum observable for every independent opera- 
tor. We recall that many further classical observables that 
do not obey the restrictions for quantum observables can 
be defined in the ensemble. 

Details of the construction will depend on the notion 
of independent operators. As a simple criterion we may 
call two operators independent if tr(y4 — B)^ > e, and 
take the limiting process e — *■ 0. Other more restrictive 
definitions of "independent" may be possible. Different 
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contradiction-free definitions of "independent" lead to dif- 
ferent classical realizations, which all result in the same 
quantum properties of the system. The explicit construc- 
tion above has demonstrated that such classical realiza- 
tions exist. Of course, there are also classical ensembles 
with "many more" states than those used in our explicit 
construction. If one is only interested in the quantum ob- 
servables the states f of such a larger ensemble can be 
mapped to the states r of the ensemble used in the con- 
struction by summing the probabilities of all states which 
have the same At for all quantum observables. 

Classical product for observables 

We next turn to the conditions under which linear 
combinations or classical products of two quantum ob- 
servables are again quantum observables. For C'^'^) — 
XaA^^^ + XbB^^^ the expectation value can always be 
expressed in terms of pk- We can therefore compute 



The condition (l37l) is nontrivial. On the classical level 



that A 



For 



A^e^^-I-Aije^^ such that (C) obeys eq.©. This al- 
lows for the construction of an operator C ([7]) which obeys 
for all p 

(C) = tr(Cp) = \a{A) + Xb(B) = tr[(AAi + XbB)p . 

(34) 

We therefore can identify C — XaA + XbB. At this step, 
however, the spectrum of possible measurement values for 
C does not necessarily coincide with the spectrum of eigen- 
values of C , which is a necessary condition for a quantum 
observable. On the classical level the spectrum of C con- 
sists of all linear combinations 



7r^=7;3)=AA7r)+A,7r 



(35) 



we can derive from {pr} the probabilities 
has the value and B takes the value 7^^^' 

quantum observables A^'^\B'^'^^ the probabilities 
Y.b'^{a,b),wl^^ = J2a'^{a,b) Can be computed from {pk}. 
In general, the information contained in {pk} will not be 
sufficient to determine 'W(^a,b), however. It will therefore of- 
ten not be possible to express (A-B) — J2a 6 7a^''7b^^^(a,6) 
in terms of {pk}- Then D = A ■ B cannot be a quantum 
observable. On the other hand, if a linear relation be- 
tween (D) and pk exists, (D) = e^^^ + e^^^'p^, we can 
write (D) = tr(_Dp) and eq. ^ implies D = ^{A,B}. 

If XaA'-'^^+XbB'^'^^ is a quantum observable for arbitrary 
Xa, Xb the associated operators A and B must commute, 
[A, B] — 0. In order to show this, we first consider the case 

where a given 7g'^'' = XaJ^^^^ + Xb^j^^ corresponds to a 
unique combination {a,b). If C*-*^^ is a quantum observ- 
able, there must exist probability distributions {pr} which 

(c) 

are an "eigenstate" for the "eigenvalue" 7g . This implies 
Wc = I , Wc=^c = or b) = 1 , W(a,6) = if a 7^ a or 
b and therefore = 1 , Wa^a — Q , — 1 , w^^f, = 0. 
We conclude that this state is also a simultaneous eigen- 
state of the observables A^'^'> and B^^\ with respective 
eigenvalues 7^^^ and 7^^^. In particular, we may consider 
a pure state "05 which is an eigenstate of C with eigenvalue 
7^*^^ It must obey A-tps = 7a'^VE , iPeA = 75^^!" , ^V'e = 
7j^Vc 1 — li^^^c- We choose a basis where C is di- 
agonal and V'e' = V'f = (1,0, . . .0). In this basis A and 
B must be block-diagonal, A^i = 7a'^^(5ai, Aip = ji'^^'Sip 
and similar for B. We can repeat this for other eigenvalues 

(C) 



states. Even if the number M^*^^ of different values 7c*^^ 
obeys M'^^^ < M as realized, for example, if M^^Uf^^ 



< 



(c) 

M, there is no guarantee that all 7c coincide with the 
eigenvalues of C. 

If C^"^-* is a quantum observable its spectrum must co- 
incide with the spectrum of C. Furthermore, the classical 
product C^*^^ •C''5) must also be a quantum observable and 
obey 

{C-C) = X\{A-A)+XI{B-B)+2XaXb{A-B) 

= tr{C^p) (36) 
= A^tr(iV) + A|tr(BV) + AAABtr({i, B}p^ . 

We find as a necessary condition that the classical product 
A ■ B must be computable in terms of pk 



for all possible pairs (a, 6) = c. It may be reduced by those 

7^"^^ for which the probability vanishes for all quantum of C. If for every eigenvalue j^^'^ of C the composition out 

of eigenvalues of A^*^' and B^'?) is unique, we can infer that 
A and B must commute. Indeed, in the basis where C is 
diagonal both A and B must be simultaneously diagonal, 
and therefore [A, B] = 0. 

IC) 

In presence of multiple possibilities of composing 7g 
from linear combinations of ji'^"' and 7^^' the discussion 
is more involved. This case appears, however, only for 
particular coefficients Xa,Xb- If C is a quantum observ- 
able for arbitrary Xa and A b such degenerate cases can be 
avoided such that A and B must commute. Indeed, con- 
sider the case of a "degenerate decomposition" for a par- 
ticular pair (Xa, Xb)- This occurs if there are two solutions 



XajL^^ + AB7bf ^ = 7c^^ with 



(B) 



(A-B) 



itr({i,B}p 



(37) 



\ (^) , \ (B) (C) 
AA7ai + Xb%^ -Tci 

= 7c^'' , 7af^^ 7^ 7^^"* . Performing an infinitesimal shift 
Xa ^ Xa + Sai while keeping Xb fixed, results in a separa- 

7c 1 



tion of 7cf' and 7cf -* , 7cf ■* 



This has to hold for arbitrary p. If any (nontrivial) lin- 
ear combination of A^'^^ and S'^'^^ is a quantum observable 
one concludes that A(*3) . ^(Q) 

must be a quantum observ- 
able with associated operator A ■ B ^ ^{A,B}. Similar 
restrictions arise for higher powers of C. 



Then jcf^ has a unique composition from 7!^^'' and jif ^- 
(We have discussed here the case of two-fold degeneracy 

where other eigenvalues of C are separated from 7cf ' , 7^^ "* 
by a finite distance. Higher degeneracies can be treated 
similarly.) 

Two classical quantum observables A^'^^ and B'^^^ are 
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called commuting observables" if arbitrary linear combi- 
nations AaA(<3) + XbB'^Q^ are also quantum observables. 
The operators A, 13 associated to a pair of commuting ob- 
servables must commute, [A, 13] = 0. Furthermore, the 
classical product of two commuting observables is a quan- 
tum observable. The associated operators and probabilistic 
observables are given by the chain of maps 

. b'Q) B} AB. (38) 

For commuting quantum observables the classical correla- 
tion is computable from the quantum system and equals 
the quantum correlation. 

We next consider general conditions for the classical 
product being a quantum observable, D^'^) — j{{Q) . 

If for all eigenvalues of D^'^^ the decompositions 7g^'' — 
la^^ll^'' are unique, the operators A and B must again 
commute. In this situation the quantum state specifies the 
probabilities Wa for the observable A to have the value 7!"^'' , 
the analogue for Wb, and in addition the joint probability 
Wc — W(^a.b) that a measurement of A yields ji"^"^ and a mea- 
surement B yields 7^ ' . Since the associated operators A 
and 13 commute we may choose a basis where both are di- 
agonal. The probability for the operator AB to take the 
value 7c^-' = with (a, b) = c, is given by the cor- 

responding diagonal element of the density matrix in this 
basis, {p')aa- This must equal Wc, and we conclude that for 
all quantum states Tr{Al3p) = Ec^c7c^^ = {D) =Tv{bp) 
and therefore eq. ([55]) applies. 

Inversely, we cannot infer that for every pair of quantum 
observables A^'^\ B^^\ for which the associated operators 
commute, 

[i(Q)^ij(Q)] = 0, the classical product A ■ B 
must be a quantum observable. There is simply no guar- 
antee that the joint probabilities w^a,b) fiiid an expression 
in terms of {pk}- As an upshot of this discussion we con- 
clude that the lack of a map O ^ Q leaves a lot of freedom 
in the choice and properties of the quantum observables. 
Generically, linear combinations and classical products of 
quantum observables are not quantum observables them- 
selves. 

Bit chains 

Finally, we discuss the special setting of "bit chains". 
The simplest bit chain is a set of three commuting two level 
observables which we take among the set of basis observ- 
ables A^'^'. For the example M = 4 we may consider the 
observables Ti,T2,T3 associated to the three commuting 
diagonal operators Li, L2, ^3- A bit chain arises if the ex- 
pectation values of two (or several) one-bit-observables as 
well as their products can be determined simultaneously 
in a quantum system. Suppose that the first bit corre- 
sponds to Ti, the second to T2. Each bit can take the 
two values -fl or —1. For a bit chain the classical prod- 
uct Ti ■ T2 is also a quantum observable, associated with 
Ta. From the expectation values (Ti) , (T2) , (T3) we can 
determine all probabilities for the four possibilities of val- 
ues (+,+) , (+, — ) , (—J—) and (—,-!-) for bits one and 
two Q. This can be easily generalized: for a bit chain of 



quantum observables with P members Tj , j = 1 . . . P, 
all mutual classical products are members of the bit chain, 
Ti-Tj = CijkTk{i ^ j), with djk = Cj,k = 1 for one particu- 
lar combination k) and zero otherwise. All associated 
operators Tj mutually commute, and TiTj = CijkTk , Tj — 
1. Also all linear combinations XiTi + XjTj are quantum 
observables, represented by the operators XiTi + XjTj. 

For given M the maximal number of members of a bit 
chain is P = M — 1 and we call such chains "complete bit 
chains" . This restriction follows simply from the maximal 
number of mutually commuting operators. The presence 
of a bound for P poses certain restrictions on the classi- 
cal realizations of bit chains associated to different sets of 
mutually commuting operators. 

As an example, consider the case M — 8. A possible 
complete three bit chain with seven members can be 
associated to the operators Ci (T3 (g) 1 (g) 1) , C2 

(l(gr3(g)l) , C3 (l(gl(gT3) , Ci (l(gT3(gT3) , C2 
(t3 (g) 1 (g) T3) , ^ (t3 g) T3 ® 1) , C -> {t3 g) T3 g) T3). 

For this "C-chain" one has CjCj — C = C1C2C3 for 
all j — 1,2,3 , Ci ■ C2 = C3. Alternative candidates 
for complete three bit chains are the "A-chain" where 
T3 is replaced by ri, or the "_B-chain" which obtains 
from the C-chain by the replacement T3 — > T2, Further 

candidates are the "F-chain" (Ci , ^2 , ^3 , A , A , -F3 , F) , 

"G-chain" (Ai, C2, vlg, Gi, (72, Gs, G) or "i?-chain" 

{Ai, A2,C3, Hi, H2, H3, H), with analogous multiplication 
structures given by the order of the elements in the 

list, i.e. P ^ (t3 (g Ti (g Ti) , G = (n (g T3 (g ti), 
H (ti g) Ti g) T3). Finally, we may consider a possi- 
ble candidate "Q-chain" {F,G, H,Qi,Q2-Q3,Q), with 

Q ^ — (t3 (g r3 (g T3). If all these sets of observables are 
simultaneously realized as bit chains, we run into contra- 
diction. From the Q-chain we conclude F ■ G ■ H = Q- In 
turn, from the P, G, H chains we infer Ci ■ A2 ■ A3 — p, 
Ai ■ C2 ■ A3 ^ G , Ai ■ A2 ■ C3 ^ H and therefore 
Q = Ci-A2-A3-Ai-C2-A3-Ai-A2-C3^Ci-C2-G3 = C. 

The operators associated to G and Q have opposite sign, 
however, showing the contradiction. 

This clearly demonstrates that not every set of two level 
observables for which the associated operators mutually 
commute can be a bit chain simultaneously. In our case 
this only poses a consistency condition for the possibilities 
of classical products of quantum observables being quan- 
tum observables themselves. If we had a map A —f 
with f{A) /(A^*?)) and /(AC^)) based on the classical 
product, one could show that for every pair of commuting 
operators A, B the map implies AB ^ A- B. The resulting 
contradiction is a proof of the Kochen-Specker-theorem [5| 
- actually the above chains of observables correspond pre- 
cisely to the elegant proof of this theorem by N. Straumann 

The observation that not all "candidate chains" 
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C, F, G, iJ, Q can be simultaneously bit chains does not 
mean that the associated sets of seven commuting op- 
erators are inequivalent. There is no problem to as- 
sociate to each such operator set a bit chain. Only 
the bit chain associated to the set Qi = (T3 (g) ri (g) 
Ti) , = iji ® n ® n) ^, Qs^ = (ti (g) Ti (g) 

T3) , Q2Q3 , Q3Q1 , Q1Q2 , Q1Q2Q3 = -{t3 (g T3 (g 

Ta) should be a new bit chain with quantum observ- 
ables {Qi,Q2,Q3 , Q2 ■ Q3 , Q3 ■ Qi , Qi ■ Q2 , Qi ■ 
Q2 ■ Q3) which are different from the candidate Q-chain 

{F,G, H,Qi,Q2,Q3,Q) discussed above. This does not 
lead to any contradiction, since the map from quantum 
observables to operators is not invertible. Both the ob- 
servables Qi and F are mapped to the same operator Qi, 
but the classical product may be a quantum observable for 

Qi • Q2 and not for F ■ Q2- In our explicit construction 
of a classical representation of observables we should not 
exclude Qi from the set of independent operators with the 
argument that it can be obtained as the product of two 
commuting observables. 

Classical entanglement 

Entanglement is a key feature of quantum mechanics. 
Its classical realization is best discussed in the context of 
bit chains. Consider M = 4 and a bit chain of observ- 
ables Ti,r2,r3, with corresponding commuting diagonal 
operators Li,L2,L3 = LiL2. We associate a first bit to 
Ti and a second bit to T2. The product of the two bits 
is then determined by the classical product Ti ■ T2 = T3. 
Let us concentrate on a state with = (T^) = — 1 , 
pi = (Ti) = , p2 = (T2) = 0. Depending on the 
other pk this may be a pure or mixed state, with purity 
P = 1 + J2k>4Pk- Fi'om pi = we infer an equal prob- 
ability to fin J for the first bit the values -1-1 and —1, and 
similar for the second bit from p2 = 0. On the other hand, 
Pa = — 1 implies a maximal anticorrelation between bits 
one and two. The probabilities vanish for all classical states 
for which both bit one and bit two have the same value cor- 
responding to 10++ = w__ = , w I- = w I- = 1/2. 

We may assume that a first apparatus measures bit one, 
and a second one bit two. Whenever the first apparatus 
shows a positive result, the second apparatus will neces- 
sarily indicate a negative result, and vice versa. By itself, 
this anticorrelation does not yet indicate an entangled pure 
state. For example, it may be realized by a mixed state 
with pfc = for fc > 4 , P = 1, corresponding to a diagonal 
density matrix p — {l/2)diag{0, 1, 1,0). 

We may compute the quantum correlation between a 
rotated spin observable A{d) with associated operator 
A{-d) = cosiJLi -I- siwdLg and a second rotated spin ob- 
servable B{ip) with B{ip) = cosipL2 + simpLi. One finds 
for arbitrary pk 

{Ai{t)B{ip)) = C(^,^) = itr({i(^),i?(^)}p) 

= cos I? cos (pp3 -I- cos i9 sin (/spe (39) 
-|- sin -d cos tppio + sin sin ippi2 , 



where we use the representations 

Li = (r3 ® 1) , L2 = (1 g) T3) , L3 = (t3 g) T3), 

Ls = (ti (g 1) , L4 = (1 (g Ti) , L12 = (ti (g n), (40) 

Le = (t3 (g Ti) , Lio = (ti (g T3) , Li4 = -(t2 (g T2). 

For all states with p^ = pi2 = — 1 , pe — Pio = 0, one 
obtains the familiar quantum result for two spins with rel- 
ative rotation 

{A{^)B{ip)) = - cos(i? - (ys) = - ip). (41) 

Bell's inequality for local deterministic theories reads for 
this situation 

\C{^i,0)-C{^2,0)\ <1 + C{'di,th). (42) 

Witheq. gl]) this reduces to |C'(i?i)- (7(152 )| <l + C{^i- 
■&2)- It is violated for i?i = tt/2 , i?2 = 7r/4. 

We observe that the contribution ^ pi2 to the quantum 
correlation matters. For our choice -di = tt/2 , ^2 = tt/A 
the inequality ^ reads |C(7r/2,0) - C(7r/4,0)| < 1 -I- 
C(7r/2, 7r/4). For pg = pg = , = — 1 and general 
P12 one finds C(7r/2,0) = , C(7r/4,0) -1/V2 and 
C(7r/2,7r/4) = pi2/\/2. For pi2 = Beh's inequality is 
now obeyed. 

We conclude that the presence of off-diagonal elements 
in the density matrix (in a direct product basis for the 
two entangled spins) plays an important role for the coex- 
istence of different complete bit chains. For our example 
with M = 4 a second bit chain besides Ti, r2, T3 = Ti • T2 
is given by Ts,T4, T12 = Tg-T^, with associated commuting 
operators Ls, L4, L12 — L^Lj^. Not only the spins in one 
direction are maximally anticorrelated for p3 = —1, but 
also the spins in an orthogonal direction (represented by 
LgjL^) are maximally anticorrelated for pi2 = —1. The 
quantum state of the subsystem allows for a specification 
of several correlations by independent elements as ps and 
P12. This possibility is closely connected to the use of quan- 
tum correlations for the calculation of the outcome of two 
measurements. In a setting where only the classical cor- 
relations are available, a simultaneous implementation of 
the two two-bit chains {Ti,T2,T3) and (T8,T4,Ti2) would 
require more than three mutually commuting objects and 
can therefore not be implemented for a state with purity 
P < 3. 

We finally display the classical formulation for two par- 
ticular entangled pure states. They are given by 

P3 = epi2 = -epi4 = -1 , e = ±1. (43) 

The sign e = +1 corresponds to the rotation invariant spin 
singlet state with density matrix 

p = i(l - (ri (g ri) - (t2 (g) T2) - {t3 (g T3)) , (44) 

and wave function 

1^ = 4s(V^2-^3). (45) 
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For e = — 1 the relative sign between ip2 and '03 is positive. 
Conclusions 

We have obtained all laws of quantum mechanics from 
classical statistics, including the concept of probability am- 
plitudes ip and the associated superposition of states with 
interference and entanglement, as well as the unitary time 
evolution. Our classical statistical description is genuinely 
probabilistic and not a local deterministic model. It al- 
lows to predict probabilities for the outcome of a chain of 
measurements, but not a deterministic result of a given 
measurement in terms of some "hidden variables" . Bell's 
inequalities do not apply for our formulation of a correla- 
tion function which is based on conditional probabilities for 
a sequence of measurements. Since the mapping from clas- 
sical observables to quantum operators is not invertible, no 
contradiction to the Kochen-Specker theorem arises. 

Our setting can be extended to include observables like 
location and momentum by considering many two-level ob- 
servables on a space-lattice and taking the limit of van- 
ishing lattice spacing. In our statistical mechanics set- 
ting quantum mechanics describes isolated subsystems of a 



larger ensemble that also includes the environment. Isola- 
tion does not mean that the subsystem can be described by 
classical probabilities for the states of the subsystem and 
sharp values of the observables in these states. It rather 
relates to a separated time evolution of the subsystem and 
to observables which can be described by quantities only 
associated to the subsystem, without explicit reference to 
the environment. 

We do not intend to enter here the debate if quantum 
mechanics or classical statistics are more fundamental -it 
is well known that classical statistics can be obtained as a 
limiting case of quantum mechanics. In our view classical 
statistics and quantum mechanics are two sides of the same 
medal. This may have far reaching consequences, as the 
possibility that the late time asymptotic state of a classical 
ensemble is given by the equilibrium ensemble of quantum 
statistics, or that certain steps in quantum computations 
can find a classical analogue. We find it remarkable that 
the conceptual foundations of quantum mechanics need not 
to go beyond the concepts of classical statistics. 
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